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ABSTRACT and CONTENTS 


This 1s the reference manual for the BCC 50@ System Programming 
Language (SPL). The syntax and semantics of the language are defined. 
This reference manual is an edited and updated version of a working paper 


written by Butler W. Lampson. 
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1. Introduction 


The BCC-500 Systems Programming Language (SPL) was designed - 
for use in systems development, translator writing and high level 
sub-system deve lopment. It is sufficiently flexible and generates 
sufficiently efficient object code to replace assembly language in 
almost all situations. SPL is highly interactive, simple to use from 
a terminal and helpful in error situations. 

This reference manual defines the syntax and the semantics of 
the language. Details about the command language, the editor and 


debugger will appear in another manual. 


2. Meta Language 


A syntax equation describes the form of some construction in © 
a language. For example, the following equations describe 
Simple arithmetic expressions. 

expression = ["+/"-"] term $(("+"/"-") term) ; 


primary $(("*"/"/") primary ); 


term 


primary symbol / number / "(" expression ")"; 

The first equation reads “an expression has the form of an optional plus 
or minus followed by a term followed by an arbitrary number (including @) 
of the constructs (denoted by '$'), either a plus or minus sign followed 
by a term." The last reads “a primary has the form of 5 Svnbel or a 
number or a literal left parenthesis followed by an expression fol lowed 
by a right parenthesis." Symbols such as 'primary', ‘term' and ‘expression’ 
are defined above and are called non-terminal symbols. 'Symbol' and 


‘'number' are not defined above and are, therefore, called terminal symbols. 


The construct "(" is an example of a literal. 


2.1 Syntax for Meta Language Equations 


equation = symbol "=" alternation "3"; 


alternation = catenation $("/" catenation); 


catenation = term $term; 
term = primary / optional / repetition / negation; 
primary = symbol / literal / "(" alternation ")" / 


Hell text Ws i : 


optional = "(" alternation "]"; 
repetition = [integer] "$" [integer] primary; 
negation = primary "-" primary; 


The alternation indicates that any alternative is legal; a catenation 
indicates that the terms follow each other. The last alternative of 
the definition of a primary is included to allow an “escape” to normal 
English text. Naturally the text may not include a ">" character. 


This convention should be avoided if possible. 


An ‘optional’ indicates zero or one occurrence of the alternatives 


enclosed in brackets. The meaning of the four kinds of repetitions are: 


Form | _Meaning 
$ something arbitrary number of ‘something's’ 
1$ something one or more ‘something's’ 
$6 something zero to six 'something's' 
2$7 something two to seven ‘something's’ 


The negation construct is used to make an exception. It translates 
“any occurrence of the first primary except for an occcurrence of the 


second primary." 


2.2 Syntax of Meta Language Tokens 


Many languages are built out of small units, called tokens. The tokens 

of the metalanguage are symbols, integers, literals and certain punctuation 
characters such as "/", "$" and ";" (see below). In such languages blanks 
and carriage returns may appear between tokens without affecting the 
meaning of the information. The usual rule is that blanks and carriage 
returns delimit (or separate) tokens, but are otherwise ignored. 

Unless the contrary is explicitly stated in our documents, the reader 


May assume the treatment of blanks is as described above. 


These equations define the tokens: 


symbol = word $(":" word); 
word = letter $(letter / digit); 


integer = digit $digit; 


literal — (tut $charl bus / Hoan $char2 ee 
charl = character - '"'; 
char2 = character - "'"; 


A symbol is a sequence of words separated with colons. A word is at least 
one letter optionally followed by letters or digit. An integer is 
represented in the normal way. A literal is a string of either charl's 
enclosed by double quotes or char2's enclosed by single quotes. The 

first form is preferred. ‘charl' is any character except double quote; 
‘'char2', any except single quote. Note that "?" "." is not the same 
as "2!" The first form implies that the Ho" ang "i" may be separated by 


blanks, while the second requires that the two characters appear 


consecutively with no intervening blanks. 


These three 


tokens. 


Character = 


letter = 


digit = 


equations define the basic set of characters for defining © 


letter / digit; | 

a a ae ae ee lO 
ae OY ae A ey Sy A A a ea A 
a eee ee Pe eg ee a hd) Kae ted 
By a Ao av le a gy at Nay A AV LA a Ge A 
GPU aS TS Pera Pe, er repo: 


3. Scopes and Program Format 


An SPL program is organized into blocks. A block begins 
with a PROGRAM or COMMON statement and ends with an END 
Statement. It has a name which is given in its initial 
Statement. Block names must be unique over the entire 


program. Thus the general format of a program is: 


program = $block; 

block = common:block / program:block; 

common: block = "COMMON" identifier ";"_ 
block: head 


end:statement ";"; 


program: block "PROGRAM" identifier ";" 

block: head 

$($(label ":")_ 

action:statement ";") 

end:statement "3"; 

block: head = $(include:statement '";") 
$(allocation:statement ";") 
$(declare:statement "3;"); 

allocation:statement = fixed:statement / 
origin:statement; 

label = identifier; 


identifier = name; 


1] 
Thus, statements must occur in a block in the order: 


PROGRAM or COMMON statement 
include:statements 
allocation:statements 
declare:statements 
action:statements 


end:statement 


A common:block must precede any blocks which INCLUDE it. 


3.1 Lexical format 

Every statement ends with a semi-colon. 

Carriage returns and blanks are treated according to 

the following rules: 

1) inside string constants or character constants 
blanks are treated like ordinary characters. 
Carriage returns are illegal in string and 
character constants (unless written with the '8' 
escape convention). 

2) elsewhere a string of carriage returns and blanks © 
is equivalent to a single blank. 

3) a blank may appear anywhere except in the middle 
of a token. Tokens include names , reserved words, 
constants, the sequences Neel Npat Ween O77 that 


are SPL two character operators. 


To summarize these rules somewhat sloppily we say that carriage 
returns and blanks are ignored except in string constants, — 
names, and reserved words. 
A comment has the form: 
comment = <carriage return> "*" <arbitrary 
string of characters not including 
carriage return> <carriage return> / 
"/*" <arbitrary string of characters 
not including "*/" or carriage return> 


("*/" / <carriage return>) ; 


The first form of comments is exactly equivalent to a 

carriage return. The second form is equivalent to a blank 
if it ends with "*/", a carriage return if it ends with a 
carriage return; the difference is apparent only if it is. 


immediately followed by "*". 


Note that a multi-line comment must have an * or /* at the 


start of each line. 


3.2 Scopes 


Each variable is declared in some block and is said to be 
local to that block. The same identifier may poner to two 
different variables which are local to different blocks. 
The variable name together with the block name, however, 
is sufficient to identify the variable uniquely. A 


variable is said to be LOCAL in scope if it is local to a 


program block, COMMON if it is local b6-4 common block. 
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Function names (i.e. names which appear immediately after 


FUNCTION or ENTRY) are GLOBAL in scope, however. 


A variable may be referenced only in a block in which it is — 
defined. Any variable is defined in the block to which it is 
local. Suppose that block € includes COMMON blocks Bi 

(i=1,...,n) in that order. Then a variable defined in Bj 

is also defined in C unless it is local to C or defined in 

Bi, i>j. A block includes Bi if Bi appears in the identifier: list 
of an include:statement in the block. | 


include:statement = "INCLUDE" identifier:list; 


identifier: list identifier $("," identifier); 
All GLOBAL variables are considered to be defined in a GLOBAL : 


COMMON block which is considered to be included in every block. 


The effect of this convention is that declarations in 
COMMON blocks can be overridden by other declarations nearer 

the point of use. Exception: a MACRO name cannot be overridden. 
Note that if B includes A and C includes B, then the variables 
-Jocal to A are defined in C (unless variables of the same 
name are local to B or C). A declaration overriding an INCLUDE 
must occur before any reference to the variable involved. 


See figure 1. for an illustration. 
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Figure 1: Determining Defined Variables 
(In the following figure <F>P means P local to F) 


GLOBAL variables 


Defined: <F>P,U,<G>V 


COMMON A; COMMON B; 
DECLARE P,Q; DECLARE R,S,;3 
Defined: <F>U,<G>V,<A>P,Q Defined: <F>P,U,<G>V,<B>R,S 


COMMON C; COMMON D; 
INCLUDE A; INCLUDE A; 
DECLARE M,N,P; DECLARE I,JU,M; 


Defined: <F>U,<G>V, Defined: <F>U,<G>\V, 
<A>Q,<C>M,N,P <A>P,Q,<D>I,J,M 


COMMON E; 
INCLUDE B,D; 
DECLARE J,R; 


Defined: <F>U,<G>V,<B>S, 
<A>P,Q,<D>1,M, 
<E>J,R 


PROGRAM F 3 PROGRAM G; 
INCLUDE C,D; INCLUDE E; 
DECLARE L; DECLARE JK; 
ENTRY P; ENTRY V3 
ENTRY U; Defined: <F>U,<G>V,J,K,<B>S, 


Defined: <F>L,P,U,<G>V, Sees Ee 
<A>Q;,<D>I 5d 5M,<C>N 


1S 


4. Allocation 


SPL has a considerable amount of machinery for controlling 
the allocation of storage for programs and data. Much of 
this machinery is of limited interest, but a few parts of 
jt are important to nearly all programmers. This section 
dpseieees the topics of general interest first, before 
going on to the others. The reader is advised to break 


off when he encounters material of no relevance to his needs. 


4.1 Permanency of Storage 


Data in SPL is of two kinds: permanent and stacked. 
Permanent data stays around for the life of a program, i.e. 
the value of a permanent data item, once set, survives unti | 
explicitly changed by the program. All data declared in 
COMMON blocks is permanent. Data declared in PROGRAM 


blocks is permanent if the block includes a 
fixed:statement = "FIXED" ["," "ORIGIN" expr]; 


The function of the ORIGIN clause is explained in Section 4.3. This 
Statement, if it is present, must appear between the include:statement 
and the declare:statements of the block. The FIXED 

program block may not be entered recursively (by funtion calls) 


during execution. This error is not checked for. 
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If a program block ig not FIXED, all the variables local to 

it are stacked. This means that their values are undefined 

when the block is entered (by a call to one of its functions), 
may become defined by the action of the program and disappear 
when the function returns. The block may be entered recursively, 
and the values of its local variables for each level 


of recursion are completely distinct. 


4.2 Layout of Core 


The arrangement of memory relative to G' (the global environment) 
is designed to group read-only objects together 

and on separate pages from writeable things, so that the 

former can be protected by the hardware from modification. 

Later improvements will permit small programs to be packed 


together better. 
Space is allocated in four main regions 
G':WGS> «RSGS:G'+40000B : CS> i :377777B | 


WGS: Writeable Global Storage, starting at G'. This area is 
allocated by a general storage allocator in the compiler in a 

piecemeal fashion: no attempt is made to keep related things together. 
All the writeable variables which appear in common blocks, together 
with fixed local environments are put here. 


Some of the first 128 words may also be used for field and array 
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descriptors, at the discretion of the compiler, except in 

the monitor ring, where this will never be done (unless forced 

by equivalences). The first few words, of course, are used for 
objects whose location is fixed by the hardware, like the stack 
descriptor. The allocation strategy for this area may be modified by 


ORIGIN statements; see below. 
Collision of this area with RSGS is a fatal error. 


The stack (where stacked data is stored) is allocated space at 
the end of this area. Its size depends on the number of 


non-FIXED PROGRAM blocks entered but not exited. 


RSGS: Read-only Scalar Global Storage. Here are put the constant 
scalars (e.g. array descriptors and initialized scalars) 

from common blocks, as well as function descriptors. This area 

is allocated by another incarnation of the general storage allocation 


used for WGS and on the same piecemeal basis. 


CS: Code Storage. Space here is allocated by block. All the 

code and constants generated by one program block, or all the 
non-scalar constants (strings, arrays, etc.) generated by 

One common block, are collected together and allocated contiguous ly 
in that region. Transfer vectors also appear here. If block A 


precedes block B lexically (in the source), then the 
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CS for A will precede the CS for B. 
4.3 Origins 


The origin:statement permits (most of the) storage of a block 


to be allocated at a fixed place. 
origin:statement = "ORIGIN" [expr]; 


The expression, whose value is called the origin of the block, 
must evaluate to an integer at compile-time. ~The statement | 
must appear in the block after any include:statement and before 


anything else. 


If the block is a program block or a common block with no 

writeable variables deSlared..he origin tells where to stare. 

its space in CS. If the space for the preceding block in CS extends 
past the specified origin, an error is recorded and the statement 

is ignored. This implies that origined blocks must appear © 

in order of increasing origins. Note that the scalar storage ~ 

of a common block is allocated in RSGS and is not see etted by 


origin:statements. 
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If the block is a common block with writeable storage, then the 
origin tells when to start this storage. Two restrictions 
apply 
1) The block must have no requirements for CS. 
2) All blocks with origined WGS must appear before any 
non-origined blocks which require WGS, so that the 
Space taken by origined blocks can be properly 


removed from the control of the storage allocator. 


All the WGS for an origined block is allocated together. A 
Subsequent block may omit the expr from its origin:statement, 
in which case its WGS is allocated immediately following that 


of the preceding block. 


4.4 Fixed Environments 
The location of a fixed local environment may be specified by 


the fixed:statement, thus: 
FIXED, ORIGIN expr; 


The origin clause tells where to put the ar vaonient. The programmer 
is responsible for the security of the area he chooses, 

which is not checked by the compiler. In the absence of the 

ORIGIN, the compiler will allocate the storage in WGS at its 


discretion. 
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4.5 Equivalence 


An equivalence can be used to fix the location of a scalar or 
an array descriptor by writing an integer-valued expression 


for the object of the equivalence. Thus 
DECLARE A = 40B, ARRAY B[30] = 41B; 


allocates A at 40 and the descriptor for the array B at locations 

41] and 42 octal. The array itself ich ented according to the default 
rules. Restriction: the value of the equivalence must be in 

the range [G' ,G'4+37777B]. An equivalence overrides all shee | 

methods of storage allocation. If a variable V has been equivalenced 

to a constant, or is declared in a common block, then 


@V is a constant whose value is the address assigned to \V. 
4.6 Fixed Fields 


Descriptors for part-word fields are normally allocated in the 
first 128 words of the global environment by the compiler if 
there is room. This allocation can be suppressed and the | 
field allocated in the function or common block like any ather 


constant by prefixing FIXED to [SIGNED] FIELD in the declaration. 
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5. Declarations 


5.1 Names 


A name 1s a sequence of not more than 16 characters starting 
with a letter, each of which must be either alphanumeric or 


an ' (apostrophe). 


5.2 Declaration Statements 
A declaration consists of a list of names together with 
Specification of scope, type and mode, and possibly 


of initialization and equivalence. Thus: 


declare:state- 
ment = "DECLARE" declare:clause:list / 


macro:statement; 


declare:clause: 


list = declare:clause $("," declare:clause); 
declare:clause = [type] [mode] item; 
j tem = identifier [form / [dimension] / [length]] 


[equivalence] [initialization]; 


equivalence = "=" (jdentifier [subscript:list]/ 
("L'" / "G'") subscript: list / expression); 
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Subscript: list = "[" expression $("," expression) pee 
initialization = "<" (expression / "(" expression 


Se expression) any \ 


Where L' means local environment and G' means the global environment. 
A declaration is processed from left to right. The attributes — 


are initialized as follows: 


scope - is determined by whether the declaration is ina 


PROGRAM block (LOCAL) or a COMMON block (COMMON) 
type = INTEGER 


mode - SCALAR 


The values of the three attributes are called the state of 

the declaration. Occurrence of attribute specifiers may change 
the state. A name is given the attributes which are in the 
State when it is encountered, except that the form, dimension, 
or length, if appropriate, may follow the name as indicated 

in the syntax of item. FUNCTION, FIELD, and ARRAY are taken 

as specifying mode unless immediately followed by a mode word, 
in which case they specify type. Occurence of a type word | 
sets the mode to SCALAR; occurence of a mode word leaves the 


type unchanged. UNKNOWN SCALARS are not allowed. 
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EXAMPLE: 
DECLARE INTEGER A, B, STRING C, ARRAY D, E [5], 
ARRAY [10] F, G (5:12); 
declares integer scalars A and B, string scalar C, string 
arrays D, E, F, and G. The array D is not assigned any 
storage, but E is assigned 5 elements and F and G get 10. 
Except for G, no space is assigned for the spine values and 
all the strings have 8-bit bytes; each element of G is 


assigned space for 5 bytes at 12 bits each. All these things 


are local. | 


5.3 Attributes 


24 


Every name has three attributes: scope, type, and mode. Each 


is chosen from a fixed set of alternatives: 


scope = 
type = 


ptype = 


~ ntype = 


integer = 


mode = 


Note that if the type 


"COMMON" / "LOCAL" / "GLOBAL" ; 
ntype / ptype / "STRING" [length] ; 


"PARAMETER" integer / "PARAMETER REAL" / 
"PARAMETER DOUBLE" / "PARAMETER COMPLEX" ; 


integer / "REAL" / "DOUBLE" / "COMPLEX" / 
"LABEL" / "LONG" / "LONGLONG" / 
"FUNCTION" / "FIELD" / "ARRAY" / 
"UNKNOWN" ; 


"INTEGER" / "OCTAL" / "CHARACTER" / 
"POINTER" 3 


"FUNCTION" / ["FIXED"] ["SIGNED"] "FIELD" [form] / 
("ARRAY" / "ARRAYONE") [dimension]/ 
"SCALAR" 


is not specified "INTEGER" is assumed. 
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Certain constructions permitted by the above syntax are forbidden 


because no reasonable meanings can be attached to them. 


1) Objects of type ARRAY or STRING with mode FUNCTION 
or FIELD do not need dimensions and lengths, and 


to give them as part of the item is an error. 


2) A form may appear only if the mode is FIELD, a 
dimension only if the mode is ARRAY, a length 
only if the type is STRING. 
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The type of scalar value determines its size: ‘integer, 

function and field are one word values; long, real, array, and 

label, are two words each;. string, double, longlong, and complex, 

are four words each. An array is represented by a two-word descriptor, 
as is a label scalar. A function scalar is represented by a pointer 

to the two-word descriptor. A field scalar is either a 

constant, if its form is specified, or occupies a single 

word. The four cases of integer are included to permit 

jee tituent printout of the value during debugging. The 

compiler recognizes only one type of integer, and the others will 


not be mentioned again. 


If a name has mode ARRAY (or ARRAYONE; they are identical 

except that the latter causes subscripts to start at 1 

rather than @), subscripted references to it will be enn Te 

on the assumption that indirection through the descriptor 

will produce the effective address. It is also possible to subscript 
INTEGER SCALARS; such references will add the value of the name to 


the subscript to produce the effective address. 


If a name is a field without a form, tailing (".", ngu or 
"@") will cause indirection through the location allocated 


to it. If it has a form, it is treated as a constant and 


2/ 


and the code compiled depends on whether it is full-word or 
part-word. If a field appears without tailing, it is 
treated as an integer whose value is the field descriptor 

if the field had a form, and the contents of the location 
allocated to it otherwise. If a field is SIGNED, the top 
bit will be copied into all the higher bit positions of a 
24-bit word when the field is used to fetch a datum. Other- 


wise, these bit positions (if any) will be filled with zeros. 
5.4 Attribute Modifiers 


The shapes and sizes of arrays and strings are specified by modifiers 
(dimensions, lengths and forms) which have already appeared 

in the syntax for attribute names. Throughout, the expressions 

must evaluate to constants at compile time. This 

means that all the operands must be constant. See “Constants” 

in Section 5.7 for a discussion of what operands are regarded as 


constant. 
5.4.1 Dimensions 


dimension = "[" expr $("," expr) 


[":" [expr] ["," expr]] "J"; 


Arrays of any dimensionality up to 7 are allowed. The first 
expression following the colon specifies the number of words 
allocated to each element of the array; this makes it easy 
to create tables with multi-word entries. The size of an 


element is limited to 64 words. If it is not specified, it 
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is taken to be the size of the euaiey object with the same 
attributes as the declared array. If an array is given an 
element size different from the one implied by its type, | 

then subscripting it yields an expression of type UNKNOWN. 


See "Expression" (Section 6.1) for the implications of this. 


The second expression following the colon tells where to 
allocate the first word of the array. If it is absent, 
the array is allocated using standard policies described 


in Section 4 "Allocation." 


5.4.2 Length 
length = men expr be [expr ] eas expr] ] so : 


The string length is specified in bytes by the first expression. 
The second expression gives the byte size, chosen from 
6, 8, 12, and 24; 8 is the default value. The third expression 


tells where to allocate the first word of the string. 


If an array or string lacks dimension or length, no space 

is allocated and no descriptor created by the compiler. In 
this case an array is assumed to take one subscript. 

If these elements are present, space is assigned to the local 
environment if the scope is LOCAL and the descriptors are 
initialized at function entry. If the scope is COMMON , 

space 1s assigned in the proper common block and descriptors 


compiled into this block. See Section 4 on "Allocation" for details. 
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5.4.3 Form 


form = "(" word:displacement [ ":" 


starting:bit "," ending:bit] 
ee : 


word:displace- 
ment = expr; 


starting: bit expr; 


ending:bit expr; 
A form specifies the word displacement and left- and right- 
most bits of a field. If the bit numbers are omitted, Q and 


23 are used. A field may not cross a word boundary. 


5.5 Equivalence 


An equivalence has the following meaning: the identifier 
following the "=", called the object, must be previously aeclared; 
if subscripts appear, 1t must be a dimensioned array 

and the number of subscripts must match the number of iemensione: 
The effect is to assign the same storage to the identifier 
preceding the “=", called tne subject, as has already 

been assigned to the object. If the identifier in the object is 


L' or G', the subject is assigned to the designated location in 


the local or global environment respectively. If the subject is 
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a dimensioned array or a string, its descriptor is assigned to 
the same location as the object (to allocate the storage for 
array or string values, see ove: otherwise the subject 
itself is assigned to the same location as the object. No 
account is taken of the possibility that the subject may 
occupy more space than has been allocated for the object. 


For some details and restrictions, see "Allocation" (Section 4). 
5.6 Initialization 


Initialization of SCALARs has the following meaning: if no 
equivalence is present, the identifier being declared becomes 
Synonymous with the initialization quantity. For INTEGERS 
which lie in [ -2090B , 1777B], no space is allocated; for all 
other types, and for INTEGERs outside this range, space is 
allocated to hold the constant value in RSGS (for COMMON 
blocks) or CS (for PROGRAMS). If an equivalence appears, the 
object must be an absolute location (see "Allocation"), a 
scalar or array element with scope = COMMON, or an element 

of an initialized local array, and the initialization 


quantity will be stored into the variable, wherever it may be. 


For each type of SCALAR, the initialization quantity must be 

a constant se that type. A LONG or a LONGLONG may be 
initialized with a string constant or with a list of integers; 
this is the only way of introducing constants of these types 
into an SPL program. An initialized STRING must not have 


a length. 
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Numeric initialized variables, if not equivalenced, may be 
re-initialized. This is primarily useful for things like 
defining fields, etc., using a compile-time counter. If 
block A includes block B, re-initialization by a declaration 
in A of a variable acquired from B has no effect on B 


or any other block that includes B. 


Initialization of FUNCTIONs is done with a single name; 
otherwise the comments above apply. Initialization of 
FIELDS is illegal; the way to do this is to specify the 


form explicitly. 


An ARRAY is initialized with a list of constants of the 
appropriate type. (Elements of the list are separated by 
commas and the list is enclosed in parentheses, as usual.) 

A FIELD ARRAY may be initialized with constant FIELD SCALARs ; 
an ARRAY ARRAY may be initialized with names of arrays which 
have been declared with dimensions. The elements of the list 
go into successive elements of the array, starting with the 
first one. For multi-dimensional arrays, the last subscript 


varies most rapidly, just as the array is actually stored. 
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A special feature allows initialization, at a later time, 
of further elements of an array some of whose elements 
have already been initialized. If X is a declared, 


initialized array, then the appearance of 
X subscript: list initialization 


as a declare:clause will cause the expression(s) in the 
initialization to be stored into elements of the array 
starting at the one designated by the subscript: list. Of 


course, all the subscripts must be constant. 


5./ Constants 


For each type there is a syntax for constants representing 


values of this type. 


5.7.1 Integer Constants 


integer:constant = simple:integer / character :constant; 


simple: integer digit $(digit) ["B" [digit]]; 

If B appears, it causes the digits to be interpreted as octal; 
otherwise they are taken to be decimal. If a digit n follows — 
the B, it is a scale factor, i.e., it is equivalent to n zeros” 


preceding the B. 
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character:constant = ("6'" $4(pseudo:character) / 
("8'" 7 "'") $3 (pseudo:character) ) 
pseudo: character = <character other than & or '>/ 


"&" Jetter/ "8&" / N&Ih / tant? 

"&" 3$3 digits 
A character constant allows up to three 8-bit or four 6-bit 
characters to be right-justified in a 24-bit word to make an 
integer. Pseudo:characters permit quotes and control characters 
to appear; the latter are specified by the letter whose 


code is less by 1@@B. 


5.7.2 Real Constants 


simple:real:constant [exponent] / 


real:constant 


digits exponent; 


simple:real:constant diqits: *." Sldigit) oy. digits 


exponent 7 = "E" sign digits; 
sign = ue ee 
digits = 1$(digit); 


The meaning of this should be obvious; the given decimal 
approximation to a real number is converted to the closest 


approximation possible in the machine's 48-bit binary representation. 
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5.7.3 Double Constants 


double:constant = (simple:real:constant / digits) "D" 


[ "+" / veld digits; 
In this case the machine's 96-bit binary representation is 


‘used. Note that D must appear in a double constant, and 


that either . or E must appear in a real constant. 
5.7.4 Imaginary Constants 
imaginary:constant = (real:constant / digits) "I"; 
Complex constants may be spasiniete: by arithmetic on pasaeconsiants 


and imaginary:constants; such arithmetic is performed at compile time, 


resulting in a single complex constant in the object code. 
9./.5 String Constants 
string:constant = ('6"' / '8"' / '"') $(pseudo:character) '"'; 


The value is a string with the specified sequence of characters 


encoded in 8-bit (default case) or 6-bit bytes as specified. 
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9.7.6 Label Constants 
label:constant = identifier; 


The identifier must not appear in a DECLARE statement; it must 
appear as a label exactly once in the function, i.e., at the 


beginning of a statement and followed by a colon. 


5././ Constant Expressions 


The compiler will evaluate any expression consisting entirely 
of constants and standard functions and thus will treat it 


like a single constant. 
5.8 Data Formats 


The formats of the various kinds of values (i.e. the nay. 
representations) are in great part determined by the hardware 
of the machine. We summarize them here for Soindeteness: 

and to Snecity a few conventions established by SPL. Refer 


to the CPU manual for the exact word layouts. 


Integers are 24-bit two's complement. 

Longs are 48-bit quantities. No operations are defined 
on them except general ones for moving and econo 
any data object. 

Longlongs are 96-bit, but otherwise identical to longs. 


Reals are 48-bit: sign, Il1l-bit exponent and 36-bit fraction. 
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Double precision real numbers are 96-bit; the format is | 
identical to that for reals, except that the fraction 
is 84-bit. | | 
Complex numbers are 96-bit and consist of two reals. 
The real part is the first, the imaginary, the second. 
Strings are four-word (96-bit) objects called descriptors. 


Each word is a hardware string indirect address word: 


Bits Function 
0-1 =2, to specify a string descriptor 
2-3. ~—s byte size: Q=6-bit, 1=8-bit, 2=12-bit, 
3=24-bit 
4-5 byte number in word, counting from left | 


6-23 ~word address 


The four words are used as follows: 


Q: start of string, points to byte before first byte of string 
Le reader pointer, points to last byte read a 

2: ~~ ~writer pointer, points to last byte weeten 

3: end of string, points to last byte available in string 


Labels use the hardware's BLL descriptor, which is too 
comp lex to be described here. Functions are represented by 


pointers to their BLL descriptors. 
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Fields use the hardware's field descriptor, which is a one word 


object with the following form: 


0-| =], to specify a field descriptor 
2 set ror SIGNED field 

3-7 length in bits 

8-12 bit address of first bit 

13-23 word displacement 


Arrays use the hardware's array descriptor, which is a two-word 


object with the following form: 


G9: Q-] =3, to specify an array descriptor 
Q: lower bound (9% or 1) on subscript 
0:3 set for marginal index descriptors (see below) 
9: 4 large element bit | 
9 : 5-6 or 
5-10 multiplier (element size) 
Q : 7-23 or 
11-23 upper bound on subscript 
1 : 6-23 address of first word of array 


Array of dimension >1 are handled by marginal indexing; 


see the discussion of arrays in Section 6.1.5. 


Intrinsic functions will exist to decompose and construct 


all these descriptors. 
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5.9 Function Declarations 


The syntax is 


function:statement = 


ftype = 


function: location = 


For example 


ftype ("FUNCTION"/ "ENTRY") 

identifier "(" [declare:clause: 

fist] “) 1%,” FRETURN" J Painctvons 
location]; | 
ntype / "STRING"; 

Me MONITOR 7 “UTIL 7 “RORY 7 
"“TRAP'ENTRY" / "FTRAP'ENTRY" / 
"SP'ENTRY" / "SYSPOP") ['«" expression]; 


FUNCTION F (I, REAL J, STRING ARRAY K); 


ENTRY and FUNCTION are synonyms. 


5.9.1 Formal Parameters 


The declare:clause: list must not include lengths or forms. 


It may include dimensions, but only the number of subscripts | 


is counted, not the values, and the subscripts may be null 


(e.g. AL,] for a matrix). 


Arrays are assumed to have one 


subscript if no dimension appears. 


Any identifiers in the declare:clause:list which have not 


already been declared are declared as though they had appeared 


in a DECLARE statement with the same attributes. If any such 


identifier has already been used, an error comment results. 


For each identifier which 


has already been declared either: 
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1) the attributes specified for it in the function 
declaration must exactly agree with the attributes 


already declared for it, or 


2) no attribute specifiers may precede it in the 


declare:clause: list. 
Otherwise there will be an error comment. 


The identifiers in the declare:clause: list constitute the 
formal arguments in the order in which they are written. 
When the function is called (see "function calls" below) 

an equal number of actual parameters must be supplied, and 
they must agree in type and mode. No automatic conversions 


are done. The agreement is checked when the call occurs. 
5.9.2 FRETURNs 


The FRETURN clause must be included if the function returns 
with FRETURN. In this case it must always be called with 
a failure clause. If any function in a program block has a 


FRETURN, the first one must. 


5.9.3 Special Entry Points 


The function: location specifies that the function is to be 
entered in one of the system-defined transfer vectors at 


the location specified by the expression. In the case of 
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POP, SPL will supply a location if none is specified. The 


possibilities are: 


POP 


TRAP' ENTRY 


SP ' ENTRY 


the function is to be callable as a POP 


the function is to be called when the 
specified (ring-dependent) hardware 


trap occurs. 


the function is to be called when the 
sSub-process in which it runs is entered 


at the specified entry point. 


The remaining ones are of interest only to system programmers : 


FTRAP'ENTRY the function is to be called when the 


MONITOR 


UTILITY 


specified (fixed) trap occurs. 


the function is to be called when the 


— specified MCALL is executed. 


These two make sense only if the function 


is in the monitor. 


the function i to be called when the 
specified UCALL is executed. This makes 
sense only if the function is ina utility. 
If any function in a program block has a 
MONITOR or UTILITY function: location, the 


first one must have it. 


SYSPOP 
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the function is to be called when the 


specified syspop is executed. 


The following tables summarize the treatment of the various 


Special kinds of entry points. 


Type of function 


Ordinary 
MONITOR 
UTILITY 
POP 

TRAP' ENTRY 


FTRAP ' ENTRY 


SYSPOP 


SP' ENTRY 


Call with Return with Put descriptor 


BLL BLL -- 

MCALL BLL MCALL Transfer Vector 
UCALL BLL UCALL TV 

Pop BLL POP TV 


Not applicable. A trap'‘entry is not really 
a function. It does not have arguments. 
The address of the first word of code 
Should be put into the TRAP TV. It is a 
programming error to reference any local 
variables or do a return. 


As for TRAP'ENTRY, but put the address of 
the first word into the FTRAP TV. 


As for TRAP'ENTRY, but put the address of 
the first word of code into the TRAP TV 
at 20B + syspop number. 


SP* CALL SP "RETURN SP oy 


A sp'entry is not really a function. It does not 

have arguments. It is a programming error to reference 
any local variables. The only proper way to call 

an sp'entry is with an sp'call and to return with 

an sp'return. 


Table 5.1 Summary of Function Call Conventions 
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: Location and contents | 
Name of TV of descriptor Contents of TV entry 


MCALL 694000B; UB=MAXMCALL Absolute address of 


function descriptor. 
Initialized to an 
error function. 


UCALL 493014B; UB=MAXUCALL As for MCALL. 

POP G'[@]; UB=MAXPOP As for MCALL. 

TRAP G'{6]; UB=12B except Absolute address of 
for user ring, where code. Initialized 
UB=20B+MAXSYSPOP to a trap routine. 

FTRAP 694902B; UB=14B As for TRAP 

SP G'[12B]; UB=MAXSP As for MCALL 


All descriptors are normal IAWs with indirect addressing; they 
point to ARRAY IAWs with LB=@ (LB=1 for TRAPS), MULT=1, BASE=indexed indirect 
Source-relative pointer to the transfer vector, which is 


allocated in code space at the discretion of the compiler. 
The MAX symbols are, for the moment, built into the compiler 


with the following values: 


MCALL = 400B, UCALL = 400B, POP = 100B, SYSPOP = 100B, SP = 20B. 


Table 5.2: Transfer Vectors 


5.10 Macros 
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The language allows a simple form of token-substitution 


macro. A macro is defined by a 


macro:statement 


macro:name 
formal: list 
formal 
macro: body 


compact: token: 
string 


"MACRO" macro:name ["(" formal: list nyny 


ne 


macro:body ; 


jdentifier ; 


[formal $("," formal)]  ; 


identifier ; 


compact: token:string; 


<arbitrary string of tokens not including ";"> 


Once a macro:name has been defined (i.e. has appeared in a 


macro:statement) it can only be used in a macro:call. A 


macro:call may appear anywhere except in a string or character 


constant. It is 
macro:cal | 
actual: list 
actual | 


balanced: token: 
string 


macro:name ["(" actual:list ") "] ; 


[actual$("," actual) | 


balanced: token:string 


<compact: token:string balanced with 


respect to parentheses, and not including 


boo 
3 


except in parentheses, or carriage return>; 
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The actual: list must be present if and only if the formal:list 
was present in the macro:statement, and must be of the same 
length as the formal:list. The macro:call is replaced by the 
macro:body, except that each occurrence of a formal in the 
macro:body is replaced by the corresponding actual. The 


result is then scanned again for further macros. 


Macros in a macro:body are expanded at definition time (unless 
they have not yet been defined, . which case they are expanded 
at call time according to the rescanning rule stated above). 

If expanded at call time, their actuals must not include 


any formals. 


Note that a macro is expanded strictly by token substitution: 
there 1S no requirement that any of the token strings involved 


make syntactic or semantic sense. 
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6. Action:Statements 


An action:statement is defined by: 

action:statement = expression/ 
if:statement/elseif:statement/endif:statement/ 
for:statement/endfor:statement/ 


assembler:statement/ 


6.1 Expressions 


This section provides the following information about SPL 


expressions: 


approximate syntax, based on the precedence of the 
operators 


exact syntax 
rules for types of operands 


the semantics of the various operators 


6.1 


4 
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Precedence of Operators 


Expressions are made up of operators and operands. The operators, 


in order from low to high precedence, are: 


FOR WHILE 

IF ELSE 

WHERE 

& 

RETURN FRETURN 
OR 

AND 

NOT (unary) 
=#> >= < <= 
« (on right) 
MOD 

+ - Vi = 


* / LSH RSH LCY 
RCY A" 

ek 

+ - N' (unary) 

GOTO 

«+ (on left) 

. $ @ 

$ @ (unary) 

[] () 


loops 

candietonale: 

sequential evaluation 

sequential evaluation 

function returns 

boolean "or" 

boolean “and" 

boolean "not" 

relations 

assignment 

modulo or remainder 

add, subtract, logical or, logical 
exclusive or 

multiply, divide, left shift, right shift, 


left cycle, right cycle, logical and 


-exponentiate 


unary + -, logical not 
transfer 

assignment 

field operations 
indirection, reference 


Subscripting, function call 
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The operands are: 


constants 
names 


parenthesized expressions 


6.1.2 Syntax of Expressions 


The section 6.1.1 Operators by Precedence List, while convenient 
for quick reference, does not suffice to specify the syntax of 
expressions. We therefore state the complete syntax; explanations 


of the meaning of the operators follow in section 6.1.3: 


expression = forexp; 
forexp = ifexp $( "FOR" forclause / "WHILE" ifexp); 
forclause = identifier "«" remainder (["," 


alternation] "WHILE" ifexp / ["BY" 
ifexp] ["TO" ifexp]) ; 


ifexp = whrexp ["IF" whrexp ["ELSE" ifexp]]; 
whrexp = catexp [ "WHERE" whrexp]; 

catexp = retexp $("&" retexp), 

retexp = alternation / ("RETURN" / “FRETURN") 


(alternation youn ifexp $("," ifexp) 
a a i 
alternation = conjunction $("0R" conjunction) / 


"GOTO" tailing; 


conjunction = negation $("AND" negation) ; 
negation = ["NOT"] relation; 
relation = assignment [("=" Pea ah ee eee a 


n<" 7 "<s") assignment]; 
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remainder = sum $( "MOD" sum) ; 
assignment = remainder / a:tailing "«" assignment; 
sum = term $(( +" / nou j nye y NE ry term) ; 


term ae factor $(('" / ee / "TSH! / URSH! / 
| MEG J UROL fA): FAClOr) s | 


factor = [ "+" / wou u | a power; 


power = tailingL "**" factor]; 
tailing = a:tailing / v:itailing ; 
a: tailing = indirection $(tail) / reference 


$("." field) ; 


Vitailing = reference $(tail) : 

tail = (7 "§" 7 8") Field 5 
indirection = 1$("$") arrayref ; 

reference = ["@"] arrayref / function:cal] : 
arrayref = ai:primary $( "[" expression Sg ae 


expression) "|" ) ; 


function:cal | = a:primary / v:primary 4 


a:primary | = jdentifier / "(" a:tailing ")" ; 
Viprimary : = constant / "(" expression ")" ; 


Note: this grammar is ambiguous because a function:call can be parsed as 


both a:primary and v:primary. a:primary parsing will be used if possible. 
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6.1.3 Types of Operands 


The various operations have various requirements for the 
types of operands permitted and the type of result produced. 
The permitted combinations are summarized in Table 6.1, in 


which the following conventions are used. 


type abbreviations: I integer 
G long or longlong 
R real 
D double 
C complex 
S string 
L label 
U unknown 
other abbreviations: F suffix means mode = FIELD 
T suffix means mode = FUNCTION 
Y suffix means mode = ARRAY 
S suffix means mode = SCALAR 


A means any type 

N means I, R, D or C (i.e. number) 

M means I, R or D 
Where A, N or M is suffixed with a digit, different digits indicate 
that different types may appear. See, for example, "WHERE": 
argument 1 and argument 2 may be different types. If the digits are 
the same, or there is no digit, the types must be the same. 
Note: A partial ordering on the numeric types is defined: [<r<b, 


R<C. Where two Ns or Ms appear, the lower is converted to the higher 
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before the operation is evaluated. If the result is N or M, 

it has the higher type also. See, for example, "+". If NS] 

were "I" (integer) and NS2 were "R" (real), then NS1 would be 
converted to "R" and the result would be "R". It is illegal to have 
one D argument and one C argument. Where A appears, the mode 

is free except as fixed by suffixes. In all other cases mode 


= SCALAR. 


Constants receive special treatment. Any type N constant jis 
automatically converted to a higher type if that is required 
for an assignment to be legal. This is not done for variables; 
the explicit transfer functions described in the definition of 


action:statement at the start of Section 6 must be used. 


An object of type U may be used where A appears in Table 6.1. 
It may also be used as one of the operands in 
the lines marked *, in which case it is assumed to have the 


type of the other. 


Note the treatment of ARRAYs, FIELDs and FUNCTIONs of type 

ARRAY, FIELD or FUNCTION. When such variables are applied 

to subscripts, pointers or function arguments, they yield results 
of type UNKNOWN and mode given by their type. Normally 

such results must be assigned to Somiee nana of known type 

before they can be used because of the restrictions on the use — 
of type UNKNOWN; thus, for example, if we want A to be an 

ARRAY of REAL. FUNCTIONS we would write 


5 | 
DECLARE FUNCTION ARRAY A, REAL FUNCTION RA; 


RA + ALT]; 
RA(X,Y + 5); 
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ARGI opt ARG2 RESULT NOTES 
A OIF I ELSE A A The A's. are required 
| | to be the same only 
Al WHERE A2 Al if the value of the 
| : | IF 1s used.) | 
Al & A2 A2 
e1S OR BS IS 
*TS AND LS IS 
- NOT bS IS 
*NS1,AS = 7 NS2,AS 3 [S 
*MS1 ae MS2 IS 
x s A A 
*MS 1] MOD MS2 MS 
*NS] +,-,*, NS2 NS 
*NS] * NS2 NS but see details below 
*IS SALE Tes IS Is 
Age 
- N° 1S TS 
- +,- NS NS 
- GOTO LS - 
IS AF AS** 
AS $ IF IS 
IS @ iF IS 
: $ 1S US 
- @ A IS 
AY Siena IS] Aoe* 
IS [ Is ] | US 
AT (A2, ..., An) , AS** 


*; one operand may be U, and is assumed to have the type of the other. 


*k: if A is ARRAY, FIELD or FUNCTION, the result is type U, mode fis 


Table 6.1 Permitted Operands and Type of Result 
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6.1.4 Semantics of Expressions 


We now complete the discussion of operations with comments 

on the evaluation of each one, together with some remarks 
which may clarify the syntax and type conversion rules given 
above. The operands are referenced by the symbols which stand 


for them in the expression schemata on the left. 


FOR,WHILE are discussed in Section 6.2.2 

Al IF I ELSE A2 evaluates I. If I # 9, evaluates 
Al and returns its value, otherwise 
evaluates A2 and returns its value, 
or returns @ if the ELSE is missing. 
Typical usage is: 


F(X) IF X < 4 ELSE G(X) IF X< 5 ELSE H(X); 


Note that: 
Keery, Try. = 3--ELSE. Yel; 


alters X only if Y < 3. Therefore 
write: 


X «+ (Y IF X < 3 ELSE ale mr 
if this is intended. 


Al WHERE A2 evaluates A2, then evaluates Al 
and returns its value. 
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Al & A2 | evaluates Al, then evaluates A2 
and returns its value. Several &'s | 
may be strung together. 


RETURN, FRETURN See Section 6.1.6 


I1 OR I2 evaluates I1, returns 1 if it 
| is # @. Otherwise evaluates I[2, 
returns | if it is # @, otherwise 
returns @. | 


11 AND 12 evaluates 11, returns @ if it is 
=$. Otherwise evaluates 12, returns 
9 if it is = @, otherwise returns 1. 


NOT I evaluates I and returns | if I = 
$9, otherwise returns @. 


Al (=,#,>,>=,<3<=)A2 *evaluates Al and A2 and then 
evaluates the relation. The value 
is @ if the relation does not hold, 
1 if it does. Note that only = 
and # are legal on non-M types. 


Al < A2 evaluates A2 and stores the resulting 
value into Al. They must agree in type 
and mode except for the special treatment 
of constants, and except that one operand 
may be of type U. 


Ml MOD M2 *evaluates Ml and M2, and returns 
M1-FIX(M1/M2)*M2 

N1(+,-5,*5/) N2 *obvious 

LTCAS SV" AE" 12 --*compute the bitwise "and," “or," or 


"exclusive-or' of the operands 


I1(LSH,RSH,LCY,RCY) I2 *these are 24-bit logical shifts 
(shift in @s) or cycles 


* see end of table. 


N] ** N2 


A $ IF 


1 @ IF 


* see end of table. 
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*obvious, except that 

I] **I12 is an error unless I2 is 
positive. The error is not caught 
until runtime if I2 1s not a constant. 


obvious. 


computes the bitwise (1's) complement 
of I. , 


sends control to the statement 

labeled by L. If this was passed 

as a parameter, the correct environment 
is restored. 


*evaluates I, takes it as an 
absolute address A, and references 
the bits of A + word:displacement 
(AF), from starting:bit (AF) to 
ending:bit (AF). The result may 
appear on either side of an 
assignment. If the field is 
SIGNED, the starting bit is copied 
into all the higher bit positions 
when the result is used as a 
value; otherwise these positions 
are filled with zeros. 


*references the bits of the value of 
A specified by IF. The word 
displacement of IF should not be 
greater than the number of words 

in the value of A. (4 at most if 

A is a variable). Sign extension 

is handled as for "." above. 


*returns T, where T is the result 
of: 


T<@ 
Sirs 1 


j.e., the value of | positioned 
in a word according to the field 
eae 


$I 


GA 


Al(A2,...,A2) 


AF(T) 
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references the value addressed by 
the value of I taken as a hardware 
indirect word. Normally the top 

6 bits of I should be off, since 
the hardware uses them to select 
the type of indirection rather 


than to specify the address. 


The value is of type U. 


returns the address of the value 

of A. It makes sense (and is 

legal) only if A can appear on 

the left of « or is a label constant. 


references the element of the array 

A specified by the subscript I, as 
described in Section 6.1.5. If 

the first operand is IS, only | 
subscript is allowed. This construct 
is equivalent to (IS + I).WO, where 
we have declared FIELD W@(@). 


returns the value of the function 
Al after calling it with parameters 
A2, as described in Section 6.1.6. © 


— equivalent to I.AF 


An * preceding the description means that the order of 


evaluation of simple operands (see Section 6.1.6.1) is 


undefined. Compound operands are always evaluated left-to-right 


if there are more than one. 


If the * is lacking, the 


operands are always evaluated left-to-right. 


oy) 


6.1.5 Array expressions 


Arrays of any dimensionality from | to 7 are allowed. If 
the array has n dimensions, then a reference to it with n 
integer subscripts yields a scalar of the same type, unless 
the type is ARRAY, FIELD or FUNCTION. In this case the type 
of the result is UNKNOWN and its mode is the type of the 
array. Thus, after declaring: 

1) INTEGER I, J, K, REAL ARRAY AL3, 4, 5] 

we know that: 

2) A(T, Jt], K**2] 

is a REAL SCALAR. It is also possible to write: 

3) ALI, J+1] 

which is an UNKNOWN ARRAY. If it is assigned to the REAL 
ARRAY B, then: 

4) B [K**2] 

references the same scalar referenced by (2). It is probably 
not useful to do anything with an UNKNOWN array except to 


assign it to something. 


Marginal indexing is used to access arrays. In the above example (1), 
the value of A is a descriptor for an array with three 

entries. Each entry of this array is a descriptor for an array 

with four eas Each entry of the four-entry array is a descriptor 
for an array with five entries, each of which jis a real number. 

The Figure 6.1.5 illustrates this. The 120 words allocated for the real | 
numbers are contiguous in storage and in the order indicated. 


Note that Fortran arrays vary the first subscript first 


and are therefore incompatible. _ 
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FIGURE 6.1.5 
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6.1.6 Function call and return expressions 


Tne syntax for a function call is 
a:primary "(" [expr $("," expr)] [stores] 
["//" failure:result [stores]]")" ; 
stores = ":" identifier $("," identifier) ; 
failure:result = ["GOTO"] identifier / ("RETURN" / 
"FRETURN") [expr / expr:list] / "VALUE" expr ; 


The a:primary must have mode=FUNCTION. The value of the © 
function is taken to be a SCALAR of type equal to the type 


of the a:primary, unless this type is ARRAY, FIELD or 


FUNCTION. In the latter case, the type of the result is UNKNOWN 


and its mode is given by the type of the function. 


6.1.6.1 Arguments 


The arguments immediately follow the function name. There 

1S no restriction on their number or type, except that an 

initialized LOCAL label or string array may not be used. 
F()s F(X); F(X,¥(1,2),Z03]**5 W,Q) s 

are function calls with O, | and 5 arguments respectively. 

Arguments are evaluated as follows. All the arguments which | 

are compound are evaluated, left to right, and their values 


are saved. An argument is simple if it is one of the 
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following, compound otherwise: 


constant 
identifier 


identifier "[" identifier "]" 
identifier "." field 
"¢" jdentifier 


ug identifier La full-word field 


The value of each argument is then stored in the corresponding 
formal argument of the function being called. No type 
conversion is done; nonmatched types are a run-time error. 


Control is then passed to the function. 
6.1.6.2 Returns 


A return is done with an expression of the form: 

RETURN (expr, expr,...; expr) / 

RETURN expr/ — 

RETURN ; 
The expression list is treated exactly like the actual 
parameter list of , function call. The value of the first 
expression becomes the value of the function; it and subsequent 
values are stored in the corresponding identifiers 
following the ":" in the call, exactly as actual parameter 


values are stored in formal parameters. 
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6.1.6.3 Failure Exits 


If a failure exit is provided following the "//" in the call, 

an FRETURN will send control as specified in the failure exit. 

It may be a label, in which exe6. conten goes to that address; a 
RETURN, in which case a return is made from the function containing 

the failure exit; or a VALUE expression, in which case the value of the 
expression becomes the value of the function. Just as for the 

RETURN, any number of values may be returned; they are stored 

in the corresponding local:identifiers following the ":". 

When a function has a failure exit the normal or success return 


is with RETURN, exactly as for a function with no error exit. 


6.2 Expressions used as Statements 


In order to catch some common errors in which the user inadvertently 
writes an expression statement which does nothing, 

a set of rules is enforced. They insure that evaluation of 

the expression results in some change in the state of the 

world; such an expression is called an action expression. 

Here the principal operator is the one of lowest precedence 

(i.e. first on the list in the Section 6.1.1 on "Precedence of 
Operators"), except that any operator enclosed inn sets of 
parentheses is of higher precedence than any operator enclosed 


in fewer than n sets of parentheses. 


62 
An expression is an action expression if the principal operator is: 
a) +«, GOTO, RETURN, FRETURN, or a function call 
b) WHERE, & FOR, WHILE, IF 
If the operator is in group (b) then: 


a) for "WHERE" or "&", both operands are action 


expressions, 


b) for "FOR" or "WHILE", the body (first operand) 


is an action expression, 


c) for "IF/ELSE", both of the consequents are 
action expressions, or the second consequent 


1S missing. 
6.2.1 IF statements 
We have seen that IF can be used as an operator. It can 
also be used in the following way: 


IF expression DQ; 


ELSEIF expression DO; 
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ELSE DO; 


ENDIF ; 
Any number of ELSEIFs are allowed. The "..." may be replaced 
by any sequence of statements balanced with respect to IFs 
and FORs. The ELSE may be omitted. The integer expressions after 
the IF and ELSEIFs are evaluated in turn until a non-zero one is 
found. The statements between it and the next ELSEIF, ELSE or ENDIF 
are then executed, and control goes to the statement following the 
ENDIF. THE ELSE DO is equivalent to ELSEIF 1 D0. If none of 
the expressions are non-zero, nothing is done. It is good 
practice to indent the statements represented by "..." uniformly 


3 or 4 spaces for clarity. 
6.2.2 FOR statements 


The same thing can be done with "FOR" as with "IF" 


for:statement; 


ENDFOR; 
Here we have 
for:statement = ("FOR" for:clause / "WHILE" expression)"DO"; 
for:clause = identifier "«" (expressionl[ "BY" expression2] 
["TO" expression3] / expressionl["," expression2] 


WHILE" expression3) ; 
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If the BY/TO form is used, the identifier must be of type M. 
If BY is omitted, BY 1 is assumed. If TO is omitted the loop 


can only terminate by an explicit transfer out. 


The effect is that oe statements ebpnecenuer oy ... are executed 
repeatedly for successive values of the controlled variable. 

In the first case the variable starts at expression]. 

On each successive loop expression2 is added until the 

variable passes beyond expression3. The definition of 

"beyond" depends on the sign of expression2. If expression] 

is beyond expression3, the loop body will not be | 
executed at all. If the expressions are compound (see Section 6.1.6 on 
"Function Calls"), they are evaluated before the loop Starts ; 


if simple, then each time around. — 


The second form initializes the controlled variable for 
expression]. Then it tests integer expression3. If it is 
%, control passes beyond the ENDFOR. Otherwise the loop 
body is executed, the value of expression2 (or expression] 
if expression2 is omitted) is assigned to the variable, and 
the test is made again. The expressions are re-evaluated 
each time around the loop. 

A WHILE statement simply loops until the integer expression. 


is @, without modifying anything. 


When "FOR" or "WHILE" is used as an operator, exactly the same 
facilities are provided. The first argument is evaluated 


each time around the loop. The value is undefined. Thus 
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ALI,J] « @ FOR I+] TO N FOR J<+1 TO M; 


would initialize the array A. 


6.3 Assembly Language 


An assembler:statement consists of one or more machine instructions 


according to the following syntax: 


assembler:statement = "." machine:instruction $('," 
["."] machine:instruction) ; 


machine: instruction = opcode [address]; 


opcode identifier / simple:integer; 


address expression; 

Since opcodes appear in a restricted context, the symbols used 
for opcodes in the CPU Manual (which are all recognized by SPL) 
may be used freely for other purposes as well. If an opcode 

is an identifier and not predefined, it must be an INTEGER 
constant. Such opcodes, as well as opcodes which are written 
as integers, are treated as follows: if no address appears, the 
value of the opcode is placed directly in the compiled program; 
if an address does appear, bits 18-23 of the opcode 

value are placed in bits 3-8 of the instruction word and bit 

17 of the value is placed in bit 9 (the programmed sperdter 


bit). 


Any expression may appear as an address as long as it is 
logically equivalent to either a constant (of any type and 


mode) or one of the addressing formats of the CPU. These 


66 


_ formats are described in detail in the CPU Manual and are listed 
below, together with the usual way of generating them. Note 


the existence of the four reserved symbols X', L', G', and R'. 


Addressing format Normal syntax 
D | G'[IN] 
I $G'[N] 
X | X'TNJ 
PD PIN] 
PDI SPN] 
BX ALT] 
BXD ($X")LI+N] 
IM | | ON 
IMX | X4N 
sR REN] 
SRI $R'[N] 
LR L' EN] 
LRI : | $L' CN] 


In the above list, N stands for an INTEGER constant quantity, 
P and I stand for INTEGER SCALAR quantities, and A stands for 
an ARRAY quantity. BX or bp adduessing may also result from 
tailing. Since the determination of the addressing format 

is done on semantic, not syntactic, grounds, the exact 


rules are quite complex. 
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7. Intrinsic Functions 
Figure 7.1 lists all the intrinsic functions in SPL. 


An intrinsic function is one which: 

1.) Is recognized by the compiler without the need 
for any declaration by the user; 

2.) May have default argument values automatically 
supplied by the compiler; 

3.) Has the types of its arguments checked at compile 
time ; 


4.) May compile into special open code. 


In figure7.1, default values for arguments which the user 
is allowed to omit are given in parentheses after the 
argument type. For all functions which have failure returns, 7 
routine which prints an error message and causes a sub- 
process trap will be supplied if the user fails to specify 


a failure action. 


The remainder of this section describes each intrinsic 
function in detail. Type letters with subscripts 

will be used to refer to the arguments of a function: 
e.g. the arguments of CNS will be referred to as I) sS513; 


and I). 


68 


een meneeeceecenneemanraenennaaenels 


NAME ARGUMENT TYPES RESULT TYPE FRETURN? = OPEN CODE? © 


FIX R | a’ | | | x 


ENTIER R 4 x 
FLOAT > R — YX 
DFLOAT I | D X 
RE C | R 
IM c OR 
CSN S.1(19) I X 
CSR S R X 
CSD S D X 
CNS TS 1( @ ) 1 (19) : x 
CRS R,S,1 (9) S X 
CDS D,S,1(P) S x 
INCDES rat I x 
LNGDES fed I | X 
GCI S ; . X X 
WCI IS I a. x 
GCD Ss i. | ‘x 
WCD 1,8 I _ a X 
SETUP $,1,1,1(8) b me: . 
I = integer C = complex 
R = real | | A = array © 
S = string D = double 


Figure 7.1 List of intrinsic functions 
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NAME ARGUMENT TYPES RESULT TYPE FRETURN? OPEN CODE? 


SETS S,1(9) 51 (9) S X 
SETR S,1() S X 
SETW S,1(9) S , 
LENGTH S I X 
SCOPY S58 a X 
APPEND SS S X 
GC S I X 
STORINIT i I X 
MAKE I ,1(@) I X 
SETZONE I I X 
SETARRAY A | I | X 
FREE 1,1(@) - X 
EXTZONE Fel - X 
FREEZONE _—— I, I() - X 
BCOPY lee) - | i 
BSET 1,I,I(-1) - X 

I = integer C = complex 

R = real A = array 

= double 


S = string D 


Figure 7.1 (continued) 
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7.1 Type Conversion Functions 


FIX(R, ) converts R, to an integer by truncation towards 
zero. 

ENTIER(R, ) converts R, to the nearest integer. 

FLOAT(I,) converts I, to single-precision floating point. 


DFLOAT(I,) converts I, to double-precision floating point. 


The four operators above are converted directly into machine 
instructions. For details consult the part of the CPU 
Manual which deals with handling of floating 


point numbers. 


RE(C, ) gives the real part of C, in Single-precision 


floating point. 


IM(C,) gives the imaginary part of C, in single-precision 


floating point. 


CSN(S, ,15//F) expects to find an integer as the beginning 


? 


2? 
Digits above 9 are allowed if 1,719: the next digit after 


of Sy> with syntax ['t' / '-'] 1$<digit in base I 


9 is A, and so on. I, is taken as 19 if not supplied. CSN 
returns the integer, which it reads off the string, advancing 
the reader pointer so that the next character to be read 


is the non-digit which ends the integer, or to 


7\ 


the end of the string. CSN fails if 34 does not begin 
with an integer in the proper format, leaving the reader 


pointer unchanged. 


CSR(S1//F) expects to find, at the beginning of S,, 
a real number in any of the formats allowed by SPL for REAL 
quantities. It returns a single-precision floating point 
number. Otherwise the action is the same as for CSN. 

CSD(S1//F) is the same as CSR except that it returns 
a double-precision floating point result. Either SPL 
REAL or DOUBLE syntax is acceptable; in the former case, 
the number is accumulated in double precision. 

CNS(T) 5S5 51451 4//F) converts the value of I, to a 
string of characters, which it appends to So: The radix 
is Ty, assumed to be 1% if omitted. If bit @ of l. is on, 
I, is converted unsigned (e.g. -2 will appear as 77777776 
in radix 8); otherwise, a '-' precedes the converted absolute 
value if I, is negative. Bits 18-23 of I, give the 
number of characters to generate: enough blanks are written 
before the converted value to give the total number of 
characters required. If the converted value does not fit 
into this many characters, it is truncated on the left with 
no error indication. If the sarneter count is 9, the converted 
value is neither padded nor truncated. I, is. taken as @ (sdaned: 
no formatting) if omitted. CNS fails only if there is insufficient 
room to write the necessary number of characters onto So! leaving 


the writer pointer is unaffected. 


72 


CRS(R, S5,14//F) appends the converted value of 
Ry to So: The failure condition is the same as for CNS. 
I, specifies the format in some as yet unspecified way; 
I, = 9 is assumed if 1, e omitted, results in some 
reasonable unformatted conversion. 

CDS(D,.S,13//F) is exactly like CRS except that the 


converted value is in DOUBLE rather than REAL format. 


7.2 String Functions 


In this section the following abbreviations are used: 

BP = beginning pointer, RP = reader pointer, WP = writer 
pointer, EP = end pointer. These correspond to the 4. 
words of an SPL string descriptor, in order. 


INCDES(I I,) assumes that I. is a character pointer 


1? ] 

(hardware string indirect word), such as one of the 4 
words in an SPL string descriptor. The value is I, 
incremented by I, character positions. See CPU Manual 


for the exact specification of this operation, which is 


done with the ASP instruction. 


LNGDES(T, 1.) assumes that I, and I, are both 
character pointers. It returns the length of the string 
which they bracket. See the CLS instruction in CPU Manual 


for details. 
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GCI(S,//F) fails if S, is empty, i.e. RP = WP. 
Otherwise it increments RP by one character position then 
returns the character pointed to by RP. 


WCI(I, ,S,//F) fails if S, is full, i.e. WP = EP. 


2 
Otherwise it increments WP then writes I, at the character position 
pointed to by WP. The value of WCI is I,. 

GCD(S,//F) fails if S, is empty. Otherwise it returns the character 


pointed to by WP and then decrements WP. 


WCD(I, S,//F) fails if So is initialized, i.e. BP = RP. 
Otherwise it writes I, at the character position pointed to by 
RP and then decrements RP. The value of WCD is I,. 

SETUP(S, 5155131,) puts into S, a string descriptor for 
a string of I, characters starting with the first character of 
the word pointed to by T,. The character size is Iq; defaulting to 
8. The value of SETUP is the string descriptor it creates. If I, 
is omitted, MAKE is called to assign space. BP = RP = WP in the 


created string descriptor. 


SETS(S, 51,13) is exactly equivalent to SETW(S, 51) 
followed by SETR(S,.1,): see below. I, and I, are taken 
as @ if omitted. 
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SETR(S, 51.) sets 51's RP to point I, characters beyond 


BP. IF I,<f, it is taken as Q; if I,>LNGDES(BP.WP), it is 


2 
taken as this quantity; if I, is omitted, it is taken as @. 


The effect is that the RP remains between the BP and the WP. 


SETW(S, I.) sets S,'s WP to point I, characters beyond 
BP. There are four cases: I,<9 leads to WP«RP<BP; Q<1, 
<LNGDES(BP,RP) leads to WP<RP+INCDES(BP 15) 3 LNGDES(BP ,RP) 
<I,<LNGDES(BP,EP) leads to WP<INCDES(BP,I,,); and I,>LNGDES 
(BP,EP) leads to WP<EP. Again, the effect is to guarantee 


the correct order of BP, RP, WP, and EP. 


LENGTH(S, ) gives the number of GCI's that can be done > 
on S, without failing, i.e. LNGDES(RP,WP). 


GC(S.) returns the character pointed to by RP. This 


1s garbage 1f 3) is empty, but no check is made. 


| SCOPY(S,,S,//F) copies the string S, into the string 


S,. S, is not affected; for S,, RP<BP and WP+INCDES 


1° 
(BP, LENGTH(S.)). Failure occurs only if there is not enough room 


in Sy and no pointers are affected. 


APPEND(S, 5S.) appends the string S5 to the string Sy. 
advancing Sy 's WP by LENGTH(S,). The failure condition is the 


same as for SCOPY. 
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7.3 Storage Allocation Functions 


There is a standard mechanism for allocating and releasing 
arbitrary-sized blocks of storage in arbitrary order called 
the storage allocator. It is driven by the following 


Standard functions: 


STORINIT (1, .1,) initializes the storage allocator 
to use an area of storage beginning at I, and occupying I, 
number of words for its machinations. It is not necessary 
to call STORINIT; a standard area will be reserved if 
STORINIT has not been called when the first request is made 
for a block. The value of STORINIT is a pointer to the zone 
just created; this pointer is also put into the predeclared 


global pointer variables INFINITY'ZONE and CURRENT'ZONE. 


MAKE(T, I.) creates a block of storage of I, words 
and returns a pointer to it. An extra cell is assigned by 
the system; it immediately precedes the block and contains 
the length in the bottom 18 bits and flags in the top 6. 
The cell is for system use only. Space is normally 
allocated directly from the area specified by STORINIT 
(or the standard default area); this area is called the 
infinity zone. The user may set up zones of his own; for 
example, if he wishes to create some fairly complex 


temporary structure and then delete it in its entirety, | 
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it is more efficient to create it ina separate zone 

and then release the entire zone. vr antennas optional, 

is a pointer to a user zone; if it is amieted ene zone 
pointed to by CURRENT'ZONE is used. CURRENT'ZONE is 

set by the function SETZONE(I,) which provides compatibility 
with the biaidware) storage allocator. A user zone is 


created by the function STORINIT. 


SETARRAY (A, ) which makes the space occupied by the 
array Ay into a new zone by setting up some machinery 
inside it. CURRENT'ZONE is not set by this function. 


Blocks are released by the function FREE. 


FREE(I, 51.) where the block pointed to by I, must 
fall within the zone optionally given by l,. When a 
zone is full, 1.e., a call on MAKE finds insufficient 
Space, an overflow function is executed. The address of 
the descriptor for this function is in word | of the zone; 
it is initialized to a system error routine when the zone is 
created. The user, of course, may change it at any time. 
The function receives the arguments of MAKE as its arguments. 
Frequently the proper course of action is to acquire additional 


Space and attach it to the zone: this is done by EXTZONE. 


EXTZONE(I,,1,) adds all the space in the block 


I, to the zone pointed to by I When a zone reaches the 


1" 
end of its usefulness, all the space occupied by the 


zone must be released using the function FREEZONE. 
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FREEZONE(T, 51.) releases all the space (including 
extra extensions) occupied by the zone I, into the zone 
I,. If the extensions were allocated out of more than 


one zone, the user must release them individually with FREE. 
7.4 Miscellaneous Functions 


BCOPY(I, 1,313) copies I, words starting at I, to 
I, words starting at [,. Copying is done in the appropriate 
direction (i.e. starting at the beginning or the end 
of the block) to ensure that no information is lost. If 
1, is omitted, 1,.SIZE is used, where FIELD SIZE (-1:6,23); 
this is where the storage allocator hides the block size. 
The intention is that I, Should be omitted if the block 
pointed to by I, was created with MAKE, since other objects 


in SPL such as arrays and strings do not have this word. 


BSET(I,,15;13) initializes I, words starting at L, 
to the value I,. If I, is omitted, I, .SIZE is used as in 
BCOPY. 
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Appendix I 
List of Keywords 


LSH © 


A’ MACRO 
AND MOD 
ARRAY MONITOR 
ARRAYONE 
| N! 
BY NOT 
CHARACTER OCTAL 
COMMON OR 
COMPLEX ORIGIN 
DECLARE PARAMETER 
DO POINTER 
~ DOUBLE POP 
PROGRAM 
F ) 
ELSE R' 
ELSEIF RCY 
END REAL 
ENDFOR RETURN 
ENDIF RSH 
ENTRY 
EXTERNAL ~ SCALAR 
SIGNED 
FIELD SP 'ENTRY 
FIXED STRING 
FOR SYSPOP 
FRETURN 
FTRAP 'ENTRY TO 
FUNCTION TRAP ' ENTRY 
G' UNKNOWN 
GOTO UTILITY 
IF ye 
INCLUDE VALUE 
INTEGER 
WHERE 
L' WHILE 
LABEL 
LCY x! 
LONG 
LONGLONG 


